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BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 
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Target Mining 
Interface 




atica 



Select Your Query Sequence 



Enter PDB accession number (e.g. 1 QMA): jjllfa [ and chain (e.g. B): 



OR 



• Enter one Swiss- Prot accession (e.g. P27504) or GenBank proteinlD (e.g. CAB08761 .1 ): [ 



Select Database 



Release: 



DEVF9 = BPD3 0 




• Iteration Filter PSI-BLAST matches to be excluded: 



Matches detected during the first 20 forward iterations □ 



if you select e.g. "Matches detected during the first 3 iterations" these matches will be excluded from the report 
(using the first_PB_iter annotation). This allows you to focus on more remote homologous which have been 
detected after 4 or more PSI-BLAST iterations. Matches detected using PSI-BLAST with negative iterations or 
using Genome-Threader are not effected by this option. However, if one match is found during the first e.g. 3 
PSI-BLAST iterations and by Genome- Threader it will be excluded. 



• Filter for the following SPECIES 

| DHomo sapiens 



□Rattus norvedcus (Rat) 



□Mus musculus (Mouse) | DDanio rerio (Zebra fish) 



II 



FIG. 2A 
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m\ http: //London-bridge. inpharmatica.co.uk/cgi-bin/volker/getTargetBPD3.pl 



2) 84 additional hits identified by both, Genome Threader and PSI-BLAST: 

Comkiitfd Gtnomf Tbv&dir and PSI - BI&J* outpvt : PSI - BLAST v&Jum art thorn m maroon! 




AAA59544.1 
drill through 
iTooSQBastHits 



iTop50aastHits 



AAA59S44.I 



Red.SegAfew 



AAB24821.1 



drill through 



Rcd.Seg.Vfew 



Q997t5 
dm through 
TooSOBIastHts 



Red.SeqAfov 



P2070I 
drffl through 
iToo50B1astrtts 



Red.Seo.V1ew 



AAC3I672.1 
drifl through 
ITqpSQ Blast Hits 



Red.SeQ.V1ew 



CM72402.1 
drill through 
Top50BtotHit5 



I AAB38702.1 
I drill through 
J j Too50Bla3tHits 

1 Red.Seq.Vtev 



! CAB70853.t 
I dril through 
j J jTocBOBfastHrta 



I Red.Sea.Vtew 



AAB24821.1 



099713 



mm 



AAC31672.1 



CAA72402.1 



AAB38702.1 



CAB7W53J. 



\ W27972.1 
| dril through 
J ;Too50BlastHits 

Red.Seo.VW 



CAA27972.1 



AAB595I2.1 



dril through 




CAA07569.1 



AAB59312.1 



Not given 



leukocyte bitegrfo alpha chain 



COLLAGEN ALPHA t(Xll) 
CHAIN PRECURSOR. 



LEUKOCYTE ADHESION 
GLYCOPROTEIN IFA-I ALPHA 
CHAIN PRECURSOR 
(LEUKOCYTE FUNCTION 
ASSOCIATED MOLECULE 1 f 
ALPHA CHAIN) (CDMA) 
(1NTEGRINALPHA-L). 



leukocyte furetion-awodated 
molecule -I alpha subunrt 



cofegen type XIV 



cartfege matrix protein 



hypothetical protein 



Not given 



Not given 



Homo 
sapiens 



nomo 
sapiens 



Homo 
sapiens 
(Human). 



Homo 
sapiens 
(Human). 



■ i 

nomo 

sapiens 



1 1 

nomo 

sapiens 



Homo 
sapiens 



Homo 
sapiens 



1 1 

homo 

sapiens 



PRI 



33.7%. 33% 
kjnmaskedSW 



Query 
ran. 
(CiT'PSI) 


Tvaet rgn. 
(GT,PS!) 


m. 
ocore 
(GT) 






r 


4-183, 
4-183 


150-336, 
! 150-336 


i 449 




PRI 



133.7%. 33% 
kjnmaskedSW 



|28JL26% 
m kjnmaskedSW 



PRI ! 



hoot. 100* 



PRI 



199.5%. 99% 



PR| |29J%,29% 
r ™ kjnmaskedSW 



PRI b]7%,27% 
rhl kjnmaskedSW 



pm |28%,28% 
rm kjnmaskedSW 



4-183, 
4-183 



| 100% 
j unmaskedGT 



150-336, 
150-336 



4-181, 
2-174 



140-318, 
2321-2495 



1-183, 
1-183 



1-183, 
1-183 



2-10), 
2-180 



4-183, 
2-182 



449 



I 100% 
iunmaskedGT 



440 



100% 
^jnmaskedGT 



153-335, 
153-335 



423 



100% 
lunmaskedGT 



153-335, 
153-335 



5-185, 
5-185 



275-455, 
39-223 



1-180, 
1-183 



PRI t 



120.5%. 20% 



kjnmaskedSW 



drill through 



Too50aastffis 



CAA07M9.1 



matriSn-4 



Homo ; pR| 30.5%. 20% 
sapiens I kjnmaskedSW 



Homo 
sapiens 



3-183, 
2-181 



3-183, 
2-181 



I PRI '( 



128.1%. 25% I 1-183, 
kjnmaskedSW j 1-183 



437-620, 
437-624 



423 



10X1% 
HinmaskedGT 



422 



1043% 
unmaskedGT 



413 



406 



100% 
4jnmaskedGT 



100% 
'unmaskedGT 



1497-1673, 
1689-1873 



758-934, 
950-1134 



342-528, 
31-217 



405 



100% 
j unmaskedGT 



405 



403 



100% 

iunmaskedGT 



100% 
UnmaskedGT 



-4 
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Forward' . Reload ,-^Hra>e Snatch ' Netscape ^ ^ Print , Security 



^^^^^Bj^r^k^; : ^L:^5^fc|http: //www. Sanger, ac. uk/cgi-bin/Pfam/nph search.cgi 



c The Pfam 

^ » ^^§®** Protein families database of alignments and HMMs 

centre 

Home | Keyword search | Protein search | DNA search | Browse Pfam | Taxonomy search | Help 




Results for gi|1788084|gMAAC74854.1| 

There wore no matches to Pf am-A (including borderline matches) for gill 788084 1 gb I AAC74854 .1 1 

Matches to Pfam-B 



Damabi 


Start |£nd j Evahte {Alignment 


!Pfam-B 39416 


233 (423 pe-103 (Mgn 



~J [427 xcsidues] 



Alignments of Pfam-B domains to best-matching Pfam-B sequence 



Format for fetching alignments to Pfam-B families Hypertext linked to SWiSSpfam □ 



Query gill788084lgblAAC74854.il/233-423 matching Pfam-B 39416 

YEAH_ECOLI 233 DLRYKHYEKRPDP S S QAVMF CLMDV S G SMD Q STKDM AKRFYILL YLFL SR 282 
DLRYKH YEKRPD P S S QAVMF CLMDV S G SMD Q STKBM AKRFYI LL YLFL SR 
gi| 1788084| gl>| AAC74854. 1| 233 DLRYKHYEKRPDP S S QAVMF CLMDV S 6 SMD Q STKDM AKRF YILL YLFL SR 282 

YEAH.ECOLI 283 TYKHVE VVYIRKHT Q AKE VDEMEFF Y S QET G GTIV S S ALKLMDEVVKERY 332 
TYKNVEVVYIRHHT QAKEVDEHEFFYSQET G GTIV S S ALKLMDEVVKERY 
gi| 1788084| gb| AAC 74854 . 1| 283 TYKHVEWYIRKKT Q AKE VDEHEFFY S QET G GTIV S S ALKLMDEVVKERY 332 

YEAH_ECOLI 333 HP AQfWNIY AAQ ASD GDHWADD SPL CMEILAKKLLP VVRYY S YIEITRRAK 382 
HPAQWHIYAAQASD GDHWADD SPL CHEILAKKLLPWRYY S YIEITRRAK 
gi| 1788084| gb| AAC74854.il 333 HP AQWHIYAAQASD GDHWADD SPL CHEILAKKLLPWRYY S YIEITRRAK 382 

YEAH_ECOLI 383 QTLWREYEKL Q STFDHF AM QKIRD QDDI YPVFRELFHKQH A 423 
QTLWREYEKL Q STFDHF AM QKTRD QDDIYPVFRELFHKQH A 
gi|1788084|gb|AAC74854.1| 383 QTLWREYEKL Q STFDHF AM QKIRD QDDI YPVFRELFHKQH A 423 



Align to family 



If you think there is anything wrong with this script please contact Pfam 







- - ' I IF 


d31^ 100% = if . -. ■ . ~ .a^rvrr" 
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LOCVS 

DEPIMITIOH 
AC CESSION 
PID 

VERSZ OH 
BB SOURCE 
KEYWORDS 
SOURCE 

ORGRHI SM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



AAC74854 427 aa BCT Ul-DEC-ZUUU 

orf, hypothetical protein [Escherichia coli Rl2). 

AAC74854 

g!788084 

AAC74854.1 01:1788084 

locus AE000273 accession AE000273 . 1 

Escherichia coli k>12. 
Escherichia coli K12 

Bacteria; pro teobacte ria; gamma subdivision; Enterobac teriaceae; 
Escherichia. 

1 (residues 1 to 427> 

Blattner, F . R - , Plunkett, G. ill, Bloch, C.A., Pema,H.T., Burland.v., 
Riley, M., collado-vides, J . Glasner, J . i> . , Rode,C.K., Mayhew, G.F., 
Gregor,J., Davis, N.W., Kirkpatrick, H. A. , coeden,M.A., Rose^D.J., 
Man, B . and shao,Y. 

The complete genome sequence of Escherichia coli X-12 

science 277 (5331), 1453-1474 (1997> 

97426617 

9278503 

2 (residues 1 to 427> 
Blattner, F.R. 

Direct Submission 

submitted (16-JAN-1997) Gup Plunkett III, Laboratory of Genetics, 
university of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA- 
Email: ecoli£genetics . wise . edu Phone: 608-282-2534 Fax: 
608-263-7459 

3 (residues 1 to 427) 
Blattner, F . R. 

Birect Submission 

Submitted (02-SEP-1997> Guy Plunkett III, Laboratory of Genetics, 
university of Wisconsin, 445 Henry Mall, Madison, WI 53706, us A. 
Email: ecolaX*genetics . wise . edu Phone: 608-262-2534 Fax: 
608-263-7459 

4 (residues 1 to 427> 
Plunkett, G . III. 
Birect submission 

Submitted (13-OCT-1998) Laboratory of Genetics, University of 
Wisconsin, 445 Henry Mall, Madison, WI 53706, USA 

This sequence was determined by the E. coli Genome Project at the 
University of Wisconsin-Madison (Frederick R. Blattner, director). 
Supported by him grants HG 00301 and KG 014 2 8 (from the Human Genome 
Project and NCHGR) . The entire sequence was independently 
determined from E. coli K12 strain M61655. Predicted open reading 
frames were determined using GeneMark software, kindly supplied by 
Mark Borodovsky, Georgia institute of Technology, Atlanta, GA, 
30332 [e-mail: mark@amber.gatech.edu]. Open reading frames that 
have been correlated with genetic loci are being annotated with CG 
Site Nos., unique ID nos. for the genes in the E . coli Genetic 
Stock Center (CGSC) database at Yale university, kindly supplied by 
Mary Berlyn. A public version of the database is accessible 
(http://cgsc.biology.yale.edu). Annotation of the genome is an 
ongoing task whose goal is to make the genome sequence more useful 
by correlating it with other data. comments to the authors are 
appreciated, updated information will be available at the E. coli 
Genome Project's world Wide web site 

(http://www.genetics.wisc.edu). *** The E . coli R12 sequence and 
its annotations are periodically updated; this is version M54. No 
sequence changes. Annotation updates: updated gene identifications 
and products; all new functional assignments courtesy of Monica 
Riley; added promoters, protein binding sites, and repeated 
sequences described in reference 1. The unique numeric identifiers 
beginning with a lowercase 'b' assigned to each gene (protein- or 
RNA- encoding) are now designated as gene synonyms instead of 
labels. This should allow them to be searched for in Entrez as gene 



names. 
Method: 



FEATURES 

source 



Protein 



conceptual translation. 
Location/ Qualif ie rs 
1. .427 

/organism="Escherichia coli K12 M 

/strain=»"Rl2** 

/sub_strain= ,, MGl655" 

/db xref-** taxon: 83333" 

1 . . 4"27 

/function^" orf ; unknown" 

/product*" orf , hypothetical protein** 

1. .427 

/gene="yeaH" 

/coded_by = **1788078: 6385. . 7668** 



/ trans 1_ table =11 
Znote^** o4 27; ThTs 



to 327 residues 
SW: P45742" 



427 aa ORF 
o f an app rox 



is 28 pet identical (4 3 gaps) 
312 aa protein V2DC_BACSU 



1 mtwfidrrin gknksmvnrq rflrrykaqi kqsiseaink rsvtdvdsge svsiptedis 

61 epmfhqgrgg lrhrvhpgnd hfvqndrier p<igggggsgs gqgqasqdge gqdefvfqis 

121 kdeyldllfe dlalpnlkqn qqrqlteykt hragytangv paniswrsl qnslarrtam 

181 taokrrelha leenlaiisn sepaqlleee rlrkeiaelr akiervpfid tfdlryknye 

^■■■..^■^^^^■ri^M^i^^p, ) iii h ^»iiiiii; m i i )ii».i i iii';iii i ii 
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FIG. 9 
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http: //victoria. inpharmatica.co.uk/~volker/BPD3target.html 




Target Mining 
Interface 




atica 



Select Your Query Sequence 



• Enter PDB accession number (e.g. 1 QMA): 



OR 



IAOxJ and chain (e.g.B): a\ 



• Enter one Swiss- Prot accession (e.g. P27504) or GenBank proteinID (e.g. CAB08761 .1 ): 



Select Database 



Release: 



DEVF9 = BPD3 □ 



Apply Filters 



• Iteration Filter PSI- BLAST matches to be excluded: 



None 



If you select e.g. "Matches detected during the first 3 iterations" these matches will be excluded from the 
report (using the first_PBjter annotation). This allows you to focus on more remote homologous which have 
been detected after 4 or more PSI-BLAST iterations. Matches detected using PSI-BLAST with negative 
iterations or using Genome-Threader are not effected by this option. However, if one match is found during 
the first e.g. 3 PSI-BLAST iterations and by Genome-Threader it will be excluded. 
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FIG. 12 




$ c The Pfam 

k danger Protein families database of alignments and HMMs 

eil tre Home | Keyword search| Protein search | DNA search | Browse Pfam | Taxonomy search| Help 




Results for ^|2367274|gblAAC76768.1| 



There were no matches to Pf am- A (including borderline matches) lor gi!2367274lg)>IAAC76768.1l 

Matches to Pf era- B 



[ Domain [Start [End j Evafee jASpnrctfj 
jPfam-B 1S204 W'W j2.4e-108 jMga | 



[427 rettics] 



Alignments of Pfam-B domains to best-matching Pfam-B sequence 

Format for fetching alignments to Pfam-B families: [Hypertext linked to SWISSpfaiTl □ 



Query giB367274lgbIAAC76768.il/204-408 matching Pfam-B 15204 

YIEM ECOLI 204 ttnJU^PEIJglCITEI^ETO^ 253 
DHR1XPPELAT1 CITELmFYKayn^QLLTYRLKCg WRg xm ppv 
gi| 23672741 gfc|AAC76768.1| 204 DI1RLLPPELATL eimmFYWlLVEKQLLTYRLKW SVREOTERPV 253 

YIEM ECOLI 254 VKKDYDE QPRCPFIY SSSMCGFRZQ CAKAP CLALMKTJiLAERRRCY 303 

?KmDZQPR6Pmc?i)T5esH66rjffQCAWciJUjnajajajn»w 

ji| 2367274| gb|AAC76768. 1| 254 VWO)YDEQPR6PmCVI)T565»66FHEQCAlOTCIJ^^ 303 

tin ECOLI 304 IWJSTEIYRra.S6PQGirQflimSQQrR66TDlJlSCniAIMna.Q5n 353 
IWJ , STEIVRraS6PQ6IEQflIPJL5QQ7R66TDlJlSCyRJUM^ 
gi| 2367274 |gb|AAC76768.1| 304 IiaTSTrrVRmS6PQCKQflIWLSQQm6TDUUCypjmimQSn 353 

YIEM EC0LI 354 EOTDADAVYI 5DPTAQRLPDDVT SKYKE1 QRVXQKRFHRV AM SAX6KP 61 403 
EOTDADAVYI SDPIAQRLPDDVT SKVKZL QRVHQKRPKAVAMSAK6KP 61 
gi| 23672741 gb|AAC76768.1| 354 EOTDADAVVI5DrrAQPXPDD?TSl^VFELQRVMQHR7>{AVAMSAK6KP6I 403 

YIEM ECOLI 404 MPXFD 408 

warn 

gi| 2367274| gb | AAC76768. 1| 404 MMTO 408 



Align to family 



If yon dunk there is anything wrong with this script, please contact Pfam 
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REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
MED I. IRE 
PUB MED 
REFEREACE 
AUTH ORS 
TITLE 
JOURAAL 



REFERERCE 
AUTHORS 
TXTLE 
JOURAAL 



REFERERCE 
AUTHORS 
TXTLE 
JOURAAL 



LOCVS AAC76768 427 a* bct 01-DEC-2000 

DEPTATTIOA orf, hypothetical protein ( Escherichia coli XI 21 
ACCESSION AAC76768 
PXD g2367274 

VERSIOR AAC76768.1 CI : 2367274 

DBS OUR CE locus AE 00 04 51 accession AE 00 04 SI , l 

KBW0RD5 

sovrce Escherichia coli K12. 

ORGAAISM Escherichia cpli 

Baoteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; 
Escherichia. 

1 (residues 1 to 427) 

Blattner,P.R Plunkett, c. Ill, Bloch, C.A., Perna,A.T., Burland,*., 
Rilep,M., Collado-vides,J., Glasner,J D., Rode,C.R., tsayhew, G P 
Crcgor,J. Davis, WW., Kirkpatrick,H. A. , Coeden,M.A., Rose, B.J./ 
Mau, B . and Shao,Y. ' 
The complete genome sequence of Escherichia coli x-12 
Science 277 (5331), 1453-1474 (1997) 
97426617 
9278503 

2 (residues 1 to 427) 
Blattner, F. R . 
Dixect Submission 

submitted (16-JAW-1997) Gup Plunkett XXX, Laboratorp of Genetics, 
university of Wisconsin, 445 Henrp Ball, Madison, VI 53706, USA. 
Email: e c olidgene tic s.w is c. edu Phone: 608-262-2534 Fax 
608-263-7459 

3 (residues 2 to 427) 
Blattner, P. R. 
Direct submission 

Submitted (02-SEP-1937) cup Plunkett XXI, Laboratorp of Genetics, 
university of Wisconsin, 445 Henrp Mall, Madison, VI 53706. USA 
Email: ecolidgenetics.visc.edu Phone: 608-262-2534 Fax- 
608-263-7459 

4 (residues 1 to 427) 
Plunkett, G. XXI. 
Direct Submission 

Submitted (13-0CT-1998) Laboratorp of Genetics, University of 
Wisconsin, 445 Henrp Mall, Madison, WI 53706, USA 
This sequence was determined bp the E. coli Genome Project at the 
University of Wisconsin-Madison (Frederick R. Blattner, director). 
Supported bp AIM grants KG00301 and KG01428 (from the Human Genome 
Project and BCHCR) . The entire sequenoe was independently 
determined from E. coli K12 strain MG1655. Predicted open reading 
frames were determined using GeneMark software, kindlp supplied bo 
5££o B ? rodo ^ kp ' ?2J ra £* institute of Technology, Atlanta, OA, 
30332 (e-mail: markSamber. gate ch. edu j . open reading frames that 
have been correlated with genetic loci are being annotated with CG 

ff^VS*'! ^SK^F^*?*^ for ia the B. coli Genetic 

Stock center (CGSC) database at *ale university, kindlp supplied bp 
Marp Berlpn. A public version of the database is accessible * 
(http://cgsc.biologp.pale.edu). Annotation of the genome is an 
ongoing task whose goal is to make the genome sequence more useful 
bp correlating it with other data. Comments to the authors are 
appreciated. Updated information will be available at the E. coli 
Genome Project's World wide Web site 

(http://www genetics. wise edu). *** The E . coli Kl2 sequence and 
its annotations are periodically updated; this is version M54 Do 
sequence changes. Annotation updates: updated gene identifications 
and products; all new functional assignments courtesy of MonXca 
Riley; added promoters, protein binding sites, and repeated 
sequences described in reference 1. The unique numeric identifiers 
beginning with a lowercase 'b' assigned to each gene (protein- or 
AAA-enooding) are now designated as gene synonyms instead of 
names 5 ' **<"* Ad allow them to be searched for in Entrez as gene 

Method: conceptual translation. 
Location/ Qualifiers 

1..427 

/organism*" Escherichia coli *12 m 
/strain»-Kl2" 
/sub_strain*-M©1655- 
/db xref="taxon: 83333" 
1. .427 

/function**" or f; unknown* 
/produot="orf , hypothetical protein" 
CDS 1..427 

/gene*"pieM < * 

/ code d_bp«* complement (236 7272: 5243. . 6532)" 
/ trans 1 table = 11 

/note«"l427; sequence change joins ORFs pieD and pieM from 
earlier version" 

cn 

1 mrsrikdary ppclteevmc pqqsqllstp qfivqlpqil dllhrlnspw aeqarqlvda 
61 nstitsalht Iflqrwrlsl ivqattlnqq lleeereqll sevqermtls gqlepiladn 
11 ntaagrlwdm sagqikrgdy qlivkpgefl neqpelkrla eqlgrsreak siprndaqme 



FEATURES 

source 



Protein 



1 ml* 



181 tf rtmvrepa tvpeqvdglq 



- ^ - - - - - - - - — Jiprndaqme 

qsddilrllP Oelatlolte l*P»fnrr>v »Vrr1 1 fri-rlK 
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FIG. 16A 




laox: MG400 
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FIG. 16B 



FIG. 17 
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"""If 


Target Mining 




Interface 
















atica 



Select Your Query Sequence 



• Enter PDB accession number (e.g. 1 QMA): 



OR 



JlJLM and chain (e.g. B): 



Enter one Swiss- Prot accession (e.g. P27504) or GenBank proteinID (e.g. CAB087B1 .1 ): \ 



Select Database 



DEVF9 = BPD3D 



Apply Filters 



• iteration Filter PSI- BLAST matches to be excluded: 



None 



If you select e.g. "Matches detected during the first 3 iterations* these matches will be excluded from the report 
(using the first_PBjter annotation). This allows you to focus on more remote homologous which have been 
detected after 4 or more PSI-BLAST iterations. Matches detected using PSI- BLAST with negative iterations or 
using Genome -Threader are not effected by this option. However, if one match is found during the first e.g. 3 
PSI-BLAST iterations and by Genome-Threader it will be excluded. 

• Filter for the following SPECIES: 



□Homo sapiens 



□Ratios norvecicus (Rat) 



| DMus musculus (Mouse) . | □Danio rerio (Zebra fish) | |Q 



FIG. 18A 
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>FHe^ \ • ft? Communicator | 



Help 



. Rorvyara ^ Reload T r f Hgfr^ T Search , Netscape ^ Flint : . Sectfi^i|| 



S£-^^^|^gk^^j , ^| htt P : //London-bridge. inpharmatica.co.uk/cgi-bin/volker/9etTargetBPD3.pl 



2) 81 additional hits identified by both, Genome Threader and PSI-BLAST: 

Combined Genome Ttaader and PSI - Bl&ft ootpd : PSI - BLAST vahrei m shown in maroon ! 



I3t 

Conf.(GT) Iter . 

(PSI 
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FIG. 20 



File £<JiC Vtew Qo £pmrfM^|c^to^ 



r iHetg 



• Forward 



■ •Reload'' y;Horrte,- ■ - ^Seafch- ^Ni^scap#^ '^-<f%?t^/s;^e^>^> r 



3 



f^g|f| ^Bc^5^^^b|S^§p^ http: //www, sanger. ac. uk/cgi-bin/Pfam/nph-search.cgl 



c The Pfam 

ISanger Protein families database of alignments and HMMs 

V^GlltrG Home | Keyword search | Protein search | DNA search | Browse Pfam | Taxonomy search | Help 




Results for g||133251|sp|P10155|RO60_HUMAN 

There were no matches to Pfam- A (inchxding borderline matches) for gil!33251 Isp IP1 01 55 lRO60_H UMAN 

Matches to Pf am~B 



} Domain 


Start {End j Evahie jAK^Bment 


jPfam-B 8344 


1 1194 !2.3e-103 lAlign 


!Pfam-B 10162 


195 {538 Me-iesjMsa 



] [538x<sidu«] 



Alignments of Pfam-B domains to best-matching Pfam-B sequence 



Format for fetching afignments to Pfam-B families: Hypertext linked to SWisspfam I I 



Query g^H33251lsplP10155IRO60_HUMANrt 

Q92787 1 

gi| 133251 |sp|Pl0155|R060_KUWAH 1 

Q9278? 51 

gi| 1332511 sp|P10155|ROG0_HUWAW 51 

Q9278? 101 

gi|133251|sp|Pl0155|R060_HVMAN 101 

Q92787 151 

gi| 133251 1 sp|PlO15S|RO60_KUMAH 151 



-194 matching Pfam-B 8344 

MEE S VN QM QPLHEKQI ANS QD GYVWQVTDMNRLMRFLCF GSEG 6TYYIKE 5 0 
HEE S VR QM QPLHEKQI AH S QD CYVWQVTDMHRLMRFL CF G SEGGTYYIKE 
WEE SVH QM QPLHEKQI AHS QD CYVWQVTDMHRLMRFL CP GSEG GTYYIKE 5 0 

QKL GLEHAE ALIRLIED GR G CEVX QEIKSF S QE GRTTKQE PMLF AL AI C S 100 
QKL GLEHAE ALIRLIED GRG CEVI QEIKSF S QEGRTTKQEPMLF AL AI C S 
QKL GLEHAEALIRLIED GRG CEVI QEXKSFS QE GRTTKQEPMLF AL AI C S 100 

Q C SDI STKQAAFKAV SEVCRIPTHLFTFI QFKKDLKE SMKC GMWGRALRK 150 
Q C SDI STKQ AAFKAV SEV CRIPTXLPTFI QFKKDLKE SMKC GMWGRALRK 
Q C SDI STKQ AAFKAV SEVCRIPTHLFTFI QFKKDLKE SMKC GMWGRALRK 150 

AI ADWYHEKG 6M AL AL AVTKYKQRR GWSMKDLLRL SMLKP S SE 6 194 
AI ADWYHEKG CM AL AL AVTKYKQRH GWSMKDLLRL SKLKP S SE 6 
AI ADWYHEKG 6MALAL AVTKYKQRR GWSMKDLLRL SMLKP S SE G 194 



| Align to family 



31 



Query gill 33251 I sp IP1 01 55 lRO60_H UMAN/1 95-538 matchin g Pfam-B 10162 

008848 195 LAIVTKYITKCWKEVMEEYKEKAL 5VEAEKLLKYLE AVEKVKRTKDDLEV 244 
LAIVTKYITKGWKEVME YKEKAL5VE EKLLKYLE AVEKVKRTKDDLEV 
gi| 133251 |sp|Pl01S5|R060_KVWAH 195 LAIVTKYITKCWKEVMELYKEKALS VET EKLL KYLE AVEKVKRTKDELEV 244 

00884 8 245 IHLIEEHQLVREHLLTHMLKSKEVWKALLQEMPLT ALLRHL GKMTAH SVL 294 
TKI.TRRM+LWRBHLLTHHLKSKEVWKALL OEMPLT ALLRHL G KMT AH SVL 

008848 295 EP GR SEV SLI CEKL SHEKI^KKARIMPFMVLI ALETYRAGMGLRGKLKWI 344 
EPGH SEV SL+ CEKL REKLLKKARIHPPH+LIALETY+ GMGLRGKLKV 
gi|l 33251 |sp|P 10155 |R060_KUMAH 295 EP GH SEV 5LV CEKL CHEKLLKKARIMPFHILIALETYKT GMGLRGKLKWR 344 

008848 345 PDKDIL QALDAAFYTTyKTVEPT GKRFLLAVD VS ASMH QRAL G SVLHAST 394 
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FIG. 21 



£|| -Edit/ $tfew .jjio Communicator : ; ; : - ..." \ 


'7 . Help: 


yj f : PiggS' ^Forward 'Reloads 4-1 ome Search Netscape : Print ; 'j< Security ', '.ffc^; 






§f "^|T'B!$#^Mfe ;^^t6c|tiei|| http: //www.ncbi.nlm.nih.gov:80/entre2/queiyJcgi?cmd=Retrieve&db==Prot §g|j|j 



LOCUS 

DEFINITION 

ACCESSI OH 
PID 

VERSION 
DB SOURCE 



KEYWORDS 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
REMARK 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
COMMENT 



ORIGIN 



ROSO HUMAN 538 da PRI 01-FEB- 1996 

60 KD RO PROTEIN <60 KD RIB ONUCLE OPROTEIN R0> <RORNP> (SJOGREN 
SYNDROME TYPE A ANTIGEN (SS-A>). 
P10155 
g!33251 

P101S5 61:133251 

swissprot: locus RO60_HUMAN, accession P10155; 
class: standard, 
created: Mar 1, 1989. 
sequence updated: Mar 1, 1989. 
annotation updated: Feb 1, 1996. 

xrefs-. gi: gi: 177782. gi: gi: 177783. gi: gi: 387656. gi: gi: 
387657. gi: gi: 86722. gi: gi: 107626 

xrefs (non-sequence databases): MIM 600063. mim 2 34 7 0 0 . PROSITE 
PS00030 

Ribonuc leoprote in; RNA-binding; Sys temic lupus erythematosus; 
Antigen, 
human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Crania ta; vertebrata; Mammalia; 
Eutheria; primates; Catarrhini; Hominidae; Homo. 

1 (residues 1 to 538> 

Deutsche r, S . L . , Marley,J.B. and Keene,J.D. 

Molecular analysis of the 60-kDa human Ro ribonucleoprotein 
Proc. Natl. Acad. Sci. U.S.A. 85 (24), 9479-9483 (1988) 
89071722 

SEQUENCE FROM N.A 

2 (residues 1 to 538) 

Ben-Che trit, E . , Gandy,B.J., Tan, E . M . and Sullivan, K. F . 
isolation and characterization of a cDNA clone encoding the 60 -kD 
component of the human SS-A/Ro ribonucleoprotein auto antigen 
J. Clin, invest. 83 (4), 1284-1292 (1989) 
89198084 

SEQUENCE FROM N.A. 

This SWISS-PROT entry is copyright, it is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available f rem http :// www. ex pa sy . ch/ sp rot 
and http://www.ebi.ac.uk/sprot 

I FUNCTION J UNKNOWN . 

isUBUNITj RO SMALL RIB ONUCLE OPROTEIN S CONSIST OF FOUR SMALL RNA 
MOLECULES OF 85-112 NT, EACH OF VKICM IS COMPLEX ED WITH A 60 KD 
PROTEIN. RO RNPS MAY ALSO CONTAIN AN ADDITIONAL 52 KD PROTEIN. 
[SUBCELLULAR LOCATION] CYTOPLASMIC . 

[DISEASE] SERA FROM PATIENTS WITH SYSTEMIC LUPUS ERYTHEMATOSUS 
OFTEN CONTAIN ANTIBODIES THAT REACT WITH THE NORMAL CELLULAR RO 
PROTEIN AS IF THESE ANTIGEN WAS FOREIGN. 

[ SIMILARITY ] CONTAINS 1 RNA RECOGNITION MOTIF (RNP) . 
[SIMILARITY] STRONG, TO XENOPUS 60 KD RO PROTEIN. 
Location/ Qualifiers 
1. .538 

/ organism^ " Homo sapiens" 
/db xref =-taxon: 9606" 
1. .^38 

Protein 1 . . 538 

/product="60 KD RO PROTEIN" 
Region 93. .98 

/ re g ion_n am e = M D omain** 

/no te = " RNA-BINDIN 6 (RNP 2) (BY SIMILARITY) . " 
Region 124. 131 

/ re gion__name Domain** 

/no te= M RNA-BINDIN G (RNPl) (BY SIMILARITY)." 

Region 239 

/ re gionjname - m c onf lie t" 

/note = "K — > R (IN REF. 2) . " 
Region 515.. 538 

/ re gion_name=" c onf lie t" 

/no te - ** GMLDM C GFDT G ALDVIRNFTLDMI -> ALQNTLLNKSF (IN REF. 
2).- 



FEATURES 

souroe 



// 



1 meesvnqmqp lnekqiansq dgyvwqvtdm nrlhrf lef g seggtyyike qklglenaea 
61 lirliedgrg ceviqeiksf sqegrttkqe pmlfalaics qcsdistkqa afkavsevcr 
121 ipthlftfiq fkkdlkesmk cgmwgralrk aiadwynekg gmalalavtk ykqrngwshk 
181 dllrlshlkp sseglaivtk yitkgwkevh elykekalsv etekllxyle avekvkrtkd 
241 elevihliee hrlvrchllt nhlkskevwk allqemplta llrnlgkmta nsvlcpgmse 
301 vslvceklcn ekllkkarih pfhilialet yk tghglrgk lkwrpdeeil kaldaafykt 
361 fktveptgkr fllavdvsas mnqrvlgsil nastvaaamc mwtrtekds ywafsdemv 
4 21 pcpvttdmtl qqvlmamsqi paggtdcslp miwaqktntp advfivftdn etfaggvhpa 
4 81 ialreyrkkm dip ak live g mtsngftiad pddrgmldmc gf dtgaldvi rnf tldmi 
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FIG. 22A 
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